Job Replication on Multiserver Systems

نویسندگان

  • YUSIK KIM
  • RHONDA RIGHTER
  • RONALD WOLFF
چکیده

Parallel processing is a way to use resources efficiently by processing several jobs simultaneously on different servers. In a well-controlled environment where the status of the servers and the jobs are well known, everything is nearly deterministic and replicating jobs on different servers is obviously a waste of resources. However, in a poorly controlled environment where the servers are unreliable and/or their capacity is highly variable, it is desirable to design a system that is robust in the sense that it is not affected by the poorly performing servers. By replicating jobs and assigning them to several different servers simultaneously, we not only achieve robustness but we can also make the system more efficient under certain conditions so that the jobs are processed at a faster rate overall. In this paper we consider the option of replicating jobs and study how the performance of different ‘degrees’ of replication, ranging from no replication to full replication, affects the performance of a system of parallel servers.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analysis of Multi-server Systems via Dimensionality Reduction of Markov Chains

The performance analysis of multiserver systems is notoriously hard, especially when the systeminvolves resource sharing or prioritization. We provide two new analytical tools for the perfor-mance analysis of multiserver systems: moment matching algorithms and dimensionality reductionof Markov chains (DR).Moment matching algorithms allow us to approximate a general distribution ...

متن کامل

A Fault-Tolerant Load-Balancing Protocol for Distributed Multiserver Queuing Systems

A simple and efficient scheme to achieve a fault tolerance in a new load-balancing protocol for a distributed multiserver queuing system is proposed. It is assumed that the distributed queuing system consists of a job producer and a number of independent servers, or workers, who compete for produced jobs. All communications between the job producer and workers in the underlying network are base...

متن کامل

Data Replication-Based Scheduling in Cloud Computing Environment

Abstract— High-performance computing and vast storage are two key factors required for executing data-intensive applications. In comparison with traditional distributed systems like data grid, cloud computing provides these factors in a more affordable, scalable and elastic platform. Furthermore, accessing data files is critical for performing such applications. Sometimes accessing data becomes...

متن کامل

Scheduling for Today’s Computer Systems: Bridging Theory and Practice

Scheduling is a fundamental technique for improving performance in computer systems. From web servers to routers to operating systems, how the bottleneck device is scheduled has an enormous impact on the performance of the system as a whole. Given the immense literature studying scheduling, it is easy to think that we already understand enough about scheduling. But, modern computer system desig...

متن کامل

Stochastic Analysis and Optimization of Multiserver Systems

Motivated by emerging trends and applications such as autonomic computing, this paper presents an overview of some research in the stochastic analysis and optimization of multiserver systems. Our primary focus is on multiserver systems in general, since this research provides the mathematical methods and results that have been and will continue to be used for the stochastic analysis and/or opti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009